WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCX 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 : 
C12Q 1/68, C12N 15/10, 15/64 



Al 



(11) International Publication Number: WO 99/57312 

(43) International Publication Date: 1 1 November ! 999 (1 1.1 1.99) 



(21) International Application Number: PCT/EP99/02964 

(22) International Filing Date: 30 April 1999 (30.04.99) 



(30) Priority Data: 

09/070.547 



30 April 1998 (30.04.98) 



US 



(71) Applicant (for all designated States except US): 

MAX-PLANCK-GESELLSCHAFT ZUR FORDERUNG 
DER WISSENSCHAFTEN E.V. [DE/DE]; Berlin (DE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): CAHILL. Dolores 
[lE/DE]; Liitzelsteincr Weg 28, D-14195 Berlin-Dahlem 
(DE). BOSSOW, Konrad [DE/DE]; Vionvillestrasse 21, 
D-12167 Berlin (DE). WALTER, Gerald [DE/DE]; Marg- 
eretenstrasse 13a. D-12203 Berlin (DE). LEHRACH. Hans 
[AT/DE]; LUtzelsteiner Weg 50. D-14195 Berlin (DE). 

(74) Agent: VOSSIUS & PARTNER; P.O. Box 86 07 67. D-81634 
Munich (DE). 



(81) Designated States: AU, CA. JP. US. European patent (AT, BE. 
CH. CY. DE. DK. ES, FI. FR, GB. GR, IE. IT, LU. MC. 
NL, PT. SE). 



Published 

With international search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: NEW METHOD FOR THE SELECTION OF CLONES OF AN EXPRESSION LIBRARY INVOLVING REARRAYING 
(57) Abstract 

~ ^fh^prSSulnv^ndon^reiatesTol libraryr The"method is characterized- 

by the step of analyzing for the expression of at least one (poly)peptide, such as a tag expressed in frame with a recombinant insert of 
said expression library, wherein said clones of said expression library are arranged in arrayed form. Screening for the expression of said 
(po]y)peptide allows for the rearraying of clones that express the recombinant insert. Upon rearraying said clones, the library will now 
comprise clones having inserts in the correct reading frame to a very high degree. Optionally, the library is additionally screened for desired 
inserts either by contacting the expressed library with a Ugand specific for the expression product or by hybridizmg or fingerprmtmg the 
inserts with a nucleic acid probe. 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCX. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Fmland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Paso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


CI 


C6tc d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Gemiany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







wo 99/57312 



PCT/EP99/02964 



New Method for the Selection of Clones 
of an Expression Library Involving Rearraying 

BACKGROUND OF THE INVENTION 

Proteins are genomic sequence information translated into functional units, enabling 
biological processes. Initial attempts at sequencing the large and complex human 
genome were intentionally focused on expressed regions, as represented by cDNA 
repertoires (Adams et al., Nature 377 (1995). 3S-174S). Meanwhile, expressed 
sequence tags (ESTs) for most human genes have been deposited in the nucleotide 
databases (Wolfsberg et al., Nucl. Acids Res. 25 (1997), 1626-1632). However, only 
a minority of these sequences have yet been assigned a function (Strachan et al., 
Nature Genet. 16 (1997), 126-132). The most straightforward solution to this 
structure-function discrepancy seems to be the direct correlation between the 
functional status of a tissue and the expression of certain sets of genes. Technology 
is now available to approach this goal on different levels of gene expression. On the 

-transcriptional-levelr gene expression-patterns-have been-analyzed~by-hybridization 

of complex probes (DeRisi et a!., Science 278 (1997), 680-686; Schena et al.. 
Science 270 (1995), 467-470; Bernard et al.. Nucl. Acids Res. 24 (1997). 1435-1442; 
Mallo et a!.. Int. J. Cancer 74 (1997), 35-44) or sets of short oligonucleotides 
(Velculescu et al.. Science 270 (1995). 484-487) to cDNA arrays, the SAGE 
sequencing approach (Wodicka et al.. Nature Biotechnol. 15 (1997). 1359-1367) or 
hybridization to oligonucleotide arrays (Maier et al.. Drug Discovery Today 2 (1997), 
315-324). 

On the transiational level, protein extracts have been mapped at high resolution on 
two-dimensional gels (Klose et a!.. Electrophoresis 16 (1995), 1034-1059). Mass 
spectrometry analysis of protein spots was then used to obtain sequence information 
(Clauser et al., Proc. Natl. Acad. Sci. USA 92 (1995). 5072-5076). Clonal cDNA 
expression in mammalian celts and matching of the protein products to two- 
dimensional electrophoresis patterns of cellular proteins was described by Leffers et 
al. (Leffers et al., Electrophoresis 17 (1996), 1713-1719). Pooled clones from an 
ordered cDNA library were expressed by in vitro transcription/translation and 
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analyzed by two-dimensional electrophoresis (Lefkovits et al.. Appl. Theor. 
Electrophor. 5 (1995), 35-42; Behar et al., Appl. Theor. Electrophor. 5 (1995), 99- 
105; Lefkovits et al.. Appl. Theor. Electrophor 5 (1995). 43-47). 

All the prior art methods have in common that rather extensive purification and/or 
analytical steps have to be performed before the required information can be 
extracted from the biological material of interest. The art is in need of methods to 
significantly reduce the amount of work involved in the functional genome analysis. 
Such a reduction in the workload would, at the same time, naturally lead to a 
reduction of manpower requirements. Accordingly, the technical problem underlying 
the present invention was to provide a method to simplify functional genomics. The 
solution to said technical problem is achieved by providing the embodiments 
characterized in the claims. 

SUMMARY OF THE INVENTION 

The present invention relates to a novel method for the selection of clones of an 
expression library. The method is characterized by the step of analyzing for the 
expression of at least one (poly)peptide, such as a tag expressed in frame with a 
recombinant insert of said expression library, wherein said clones of said expression 
library are arranged in arrayed form. Screening for the expression of said 
(poly)peptide allows for the rearraying of clones that express the recombinant insert. 
Upon rearraying said clones, the library will now comprise clones having inserts in 
the correct reading frame to a very high degree. Optionally, the library is additionally 
screened for desired inserts either by contacting the expressed library with a ligand 
specific for the expression product or by hybridizing or fingerprinting the inserts with a 
nucleic acid probe. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to a method for the selection of clones of an expression 
library that express inserts comprising the steps of 

(a) analyzing for the expression of at least one detectable (poly)peptide 
expressed as a fusion protein with an expression product of a recombinant 
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insert of a clone of said expression library, the clones of said expression 
library being arranged in arrayed form; and 
(b) rearraying clones in which expression of said at least one (poly)peptide is 
detected. 

The temn "recombinant insert" as used in accordance with the present invention 
denotes a nucleic acid fragment which is present in the expression vector used for 
the preparation of said expression library such that it yields an open reading frame 
together with the nucleic acid fragment encoding said at least one (poly)peptide, the 
expression of said open reading frame resulting in said fusion protein. 

The term "clone of an expression library" as used in connection with the present 
invention denotes any propagable, essentially clonal biological material that contains 
recombinant genetic material and is part of an expression library. Typically, this term 
will refer to bacterial transformants but may also relate to other transformants or to 
recombinant viral material or bacteriophage. The term "expression library" is well 
understood in the art; see, for example, Sambrook et al., "Molecular Cloning, A 
Laboratory Handbook". 2"^ edition (1989), CSH Press. Cold Spring Harbor, N.Y. 
Preferably, the expression library can be induced by an inductor. Inductors are 
--known-in-the art-and-ineluderfor-examplerlPTG.-Various-type& of -expression libraries - 
are known in the art. All of these types are encompassed by the present invention. A 
preferable type of library is a library resulting from exon trapping, i.e. an exon trapped 
library, or a library made in a shuttle vector, for example, a vector which can be used 
in prokaryotic and eukaryotic systems, or in multiple prokaryotic and/or in multiple 
eukaryotic systems. Further, it is well known that expression libraries can be 
constructed from a large variety of sources. Again, the present invention envisages 
the use of all said sources in the above mentioned method. Such sources may be, for 
example, mammalian or other eukaryotic cells, tissue, bacteria, other 
microorganisms, plant, yeast, blood, or cell lines. 

The term "(poly)peptide" refers both to peptides and to polypeptides, naturally 
occumng or recombinantly. chemically or by other means produced or modified, 
which may assume the three-dimensional structure of proteins and may be post- 
translationally processed, optionally in essentially the same way as native proteins. 
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The term "fusion protein" denotes any polypeptide consisting or comprising of at least 
two (poly)peptides not naturally forming such a polypeptide. On the DNA level, the 
two or more coding sequences are fused in frame. 

The term "arrayed form" as used herein refers to any regular or non-regular form that 
can be replicated. Preferred are regular forms, in particular high density grids as 
described, for example, in Lehrach et al.. Interdisciplinary Science Reviews 22 
(1997). 37-44. 

The method of the invention displays significant advantages over prior art methods 
and is particulariy suitable for the efficient analysis of mammalian and/or plant and/or 
other eukaryotic genomes but can. of course, also be applied to the analysis of other 
expression libraries, e.g., genomic DNA expression libraries from prokaryotic or other 
microorganisms. The new method significantly reduces the background of false- 
positive clones in expression library screening. Especially when large numbers of 
clones within one or more libraries are screened, the time consuming work of 
identifying clones that eventually turn out to not have the desired biological properties 
can be avoided. This, of course, will also lead to a significant reduction of the cost 
^ 'factor in genomic and/orproteomic-anaiysis: 

In a preferred embodiment of the invention, said method further comprises the 
following steps 

(c) contacting a ligand specifically interacting with a (poly)peptide expressed by 
the insert of a clone conferring a desired biological property with said library or 
a first replica of said rearrayed library of clones and analyzing said library of 
clones for the occurrence of an interaction; and/or 

(d) carrying out a hybridization or an oligonucleotide fingerprint with a nucleic acid 
probe specific for the insert of a clone conferring said desired biological 
property with said library or said first replica or a second replica of said library 
of rearrayed clones and analyzing said library of clones for the occurrence of a 
hybridization; and 
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(e) identifying and/or characterizing clones wherein an expression of the at least 
one (poly)peptide in step (a) and/or an interaction in step (c) and/or a specific 
hybridization or an oligonucleotide fingerprint in step (d) can be detected. 

The term "desired biological property" is intended to enconnpass functional as well as 
non-functional biological properties such as structural properties. Functional 
properties may. for example, be binding properties as conferred by antibodies or 
fragments or derivatives thereof. In another altemative, said functional properties 
may relate to the turnover of target-molecules, such as provided by enzymatic 
activities. On the other hand, non-functional properties may relate to the primary 
structure of a nucleic acid that can be detected, for example, by nucleic acid 
hybridization. 

The term "ligand" as used herein comprises any type of molecule that is. by way of its 
three-dimensional structure, capable of specifically interacting with a desired 
(poly)peptide. Depending on its three-dimensional structure, said ligand may also 
interact non-specifically with (poly)peptides expressed by the recombinant inserts. A 
typical example of a ligand is an antibody or another receptor such as a hormone 
receptor. Regarding antibodies, a typical example of a non-specific interaction is a 
"cross-reaction: " - 

The term "hybridization" with a nucleic acid probe refers to specific or non-specific 
hybridization. Whether a hybridization is specific or non-specific depends on the 
stringency conditions, as is well known in the art. The term "specific hybridization" 
relates to stringent conditions. Said hybridization conditions may be established 
according to conventional protocols described, for example, in Sambrook. "Molecular 
Cloning, A Laboratory Handbook", 2""* edition (1989), CSH Press, Cold Spring 
Harbor, N.Y.; Ausubel, "Current Protocols in Molecular Biology", Green Publishing 
Associates and Wiley Interscience, N.Y, (1989); or Higgins and Hames (eds) "Nucleic 
acid hybridization, a practical approach" IRL Press Oxford, Washington DC (1985). 
An example for specific hybridization conditions is hybridization in 4 x SSC and 0.1% 
SDS at 65°C with subsequent washing in 0.1 x SSC, 0.1% SOS at 65^C. 
Alternatively, stringent hybridization conditions are, for example, 50% formamide, 4 x 
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SSC at 42°C. Non-specific conditions refer, for example, to hybridization in 4 x SSC, 
1% SDS at 50°C and washing at the sanne conditions. 

In accordance with the present invention step (c) and/or (d) can be performed with 
said library and/or a first replica and/or a second replica and/or a further replica of 
said library. If said library or said first or second or further replica is used in two 
different steps, any material added during the step (a) and/or (c) which may interfere 
with the subsequent step(s) may. optionally, be removed prior to the performance of 
the subsequent step, preferably according to conventional protocols. 

The term "identifying clones" comprises all types of identification steps suitable to 
identifying the clone of interest. For example, clones may be identified by visual 
means, for example, if the (poly)peptide expressed as a fusion protein with the 
recombinant insert is Green Fluorescent Protein and the ligand or the probe are 
labeled with a visually detectable label, e.g., alkaline phosphotase, horseradish 
peroxidase, or FITC. Furthermore, positive clones may be identified by the blue/white 
selection which is well known in the art. Alternatively, if the nucleic acid probe is 
marked with a radioactive label, exposure to an X-ray film may help identifying the 
desired clone. The clones may also be identified using mass spectrometry. 



The term "oligonucleotide fingerprinting" describes generating a sequence 
dependent, reproducible, statistically significant pattern or fingerprint of the sequence 
obtained by analyzing the hybridization pattern (hybridization/no hybridization) 
obtained on hybridizing a number of oligonucleotides onto the nucleic acid, preferably 
DNA, 

A particular advantage of this embodiment of the present invention is that the 
investigator has the choice to select between a nucleic acid probe and a ligand for 
screening his library for the desired clones. The combination of steps (a), (b). (c) and 
(d) will further enhance the reliability of the method of the invention for identifying the 
actually desired clones. Surprisingly, it could be shown in accordance with the 
invention, that, upon the original spotting of transformants in an array, and the 
subsequent growth of colonies, said detectable (poly)peptide can still be detected 
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without disturbance of the array structure. This holds also true if the colonies have 
been cultivated for about 18 hours. 

As regards the (poly)peptide expressed as a fusion protein with a recombinant insert 
of a clone of said expression library, it is to be noted that the present invention 
envisages the use of one or more of said (poly)peptides incorporated into said fusion 
protein. As is apparent from the appended examples, fusion of the (poly)peptide to 
the N-terminus allows for the detection of inserts that are expressed in frame since, 
as a rule, inserts which are not in frame with the N-terminal (poly)peptide will be 
rapidly degraded within the cytoplasm. On the other hand, the fusion of said 
(poly)peptide to the C-terminus and detection of said (poly)peptide allows for the 
selection of full-length inserts. Also, the present invention envisages the combination 
of one or more (poly)peptides fused to the N-terminal and C-terminal end of the 
insert. 

It is to be noted that prior to carrying out steps (a) to (e) the clones should present 
the biological material to be tested for in an accessible form. If the clones are, for 
example, bacterial transformants. said transformants would preferably have to be 
lysed. Such lysis methods are well known in the art. 



The application of computer-related technology with the method of the invention 
allows for the fact that screening needs to be done only once for a library. This is 
because data produced for individual clones by a later analysis, e.g. sequencing, can 
be related back to this screening. Accordingly, a rapid transition from an expression 
library such as a cDNA library to a protein library has become possible. This creates 
a direct link between a gene catalogue and a functional protein/(poly)peptide 
catalogue. In addition to the above, a repeated screening of or a prolonged screening 
reaction may further enhance the chance of excluding false-positive clones. 

In accordance with the present invention the method may also be used to 
characterize already known nucleic acid molecules. 

In a further preferred embodiment of the method of the invention, said (poly)peptide 
expressed as a part of a fusion protein with said expression product of said 
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recombinant insert is an antibody or a fragment or derivative thereof, a tag, an 
enzyme, or a phage protein or fragment thereof, or a fusion protein. 

Methods for detecting any embodiment of the above specified (poly)peptide are well 
known in the art or can be devised by the person skilled in the art without further ado. 
For example, antibodies can be detected by anti-antibodies which are detectably 
labeled. As regards the antibody fragments or derivatives thereof, these may include 
F(ab')2. Fab, Fv or scFv fragments; see, for example, Harlow and Lane, "Antibodies, 
A Laboratory Manual". CHS Press (1988), Cold Spring Harbor, N.Y. Further, tags 
may be detected according to conventional methods. The same holds true for 
enzymes which may be detected, for example, by reacting the same with a specific 
substrate and detecting, for example, a color reaction, or by using a detectably 
labeled antibody specific for said enzyme. Antibodies may also be used to detect 
phage or fragments thereof. Labels for antibodies are also well known in the art and 
include alkaline phosphotase (ATTPPHOS), CSPD. horseradish peroxidase, FITC, 
and radioactivity. Also, mass spectrometry can be used for detecting any 
embodiment of the above specified (poly)peptide. 

In a further preferred embodiment of the method of the invention, said analysis for 
— the~expression-of~a-(poly-)peptide- -in-step-(-a)~iS' effected -by- -eontaeting~a-ligand — 
different from the ligand of step (c) that specifically interacts with said (poly)peptide 
and analyzing said library of clones for a specific interaction to occur. The ligand 
used in step (a) may be the same class of ligand that is used in step (c). However, 
the actual molecular structure of the ligand should be different in both steps in order 
to be able to differentiate between the two ligands. 

In an additional preferred embodiment of the method of the invention, said analysis 
for the expression of a (poly)peptide in step (a) is effected by visual means. 

Advantageously, the expression of said (poly)peptide can be detected by visual 
means such as by fluorescence, bioluminescence or phosphorescence. The 
corresponding signals may be stored by photographic means which may be attached 
to a computer unit. The corresponding signals may be imaged using a high resolution 
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CCD detection system, saved and stored on computer as image files and analyzed 
using custom written software to score positive clones. 

It is most preferred that said visual means employ mass spectrometry. For example, 
here mass spectrometry analysis of the arrayed proteins allows the use of the protein 
arrays as a bridge to link DNA, mRNA, and/or complex hybridization results to 2-D- 
PAGE results. This is done by generating mass spectra of the arrayed proteins (e.g. 
on a chip, a mass spectrometry target or a matrix), and comparing these mass 
spectra with mass spectra generated from spots on 2-D gels. Using this approach, 
the mRNA repertoire of a cell (via the cDNA library) may be studied as the first level 
of gene expression, which most directly reflects gene activity, and may be related to 
proteome analysis which is the analysis of the protein complement of a cell, tissue, 
plant, microorganism and/or organism. 

Curently. the isolated proteins from 1-D and 2-D gels are identified in sequence 
databases using mass spectrometry. Clearly, however, this is limited to the few 
known proteins. Advantageously, this limitation is overcome by the concept of the 
present invention, namely that each protein, expressed by the clones of the 
expression libraries, is specified by a minimal set of structural information, which is 
"destghated' "minimal' protein'^identifier"~(MPI).—^^^ 

combined with additional' structural data, may be optimized in two ways, for 
unambiguous protein identification and for high throughput determination by mass 
spectrometry. 

Once recorded, MPls facilitate tracing gene products in biological samples, simply by 
comparing the measured data. In this way. protein recognition is independent of 
whether the protein is "known" (i.e. present in the current databases) or "unknown" 
(i.e. not present in the current databases). These spectra can be used to identify 
spectra subsequently generated from the analysis of protein from other sources, e.g. 
such as from separated proteins from 1-D and 2-D electrophoresis gels. 

This provides a bridge that connects the proteins characterized by 2-D 
electrophoresis, with their coresponding mRNAs and genes (cDNAs). All MPls 
collected from 2-D gels are compared by computer-based methods {in silico) with the 
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MPIs obtained from the recombinant protein library, and vice-versa. Thereby, 
thousands of biologically active gene products can be linked to their genes. This 
linkage is independent of any sequence information and, therefore, also attractive for 
functional proteome analysis of other organisms. 

Another advantage of the strategy of the present invention, compared to current 
strategies, is that protein identification becomes more reliable because mass 
spectrometric data are compared with mass spectrometric data, and not with data 
predicted from DNA or protein sequences. Major shortcomings of the latter approach 
are that substrate dependent protease performance, peptide solubility, and final 
signal suppression in the mass spectrometric analysis are not considered. 

Furthermore, the protein arrays of the present invention allow exploring mass 
spectrometric data of thousands of different proteins taken from 2-D gels by using 
their recombinant homologues labeled with stable-isotopes. In addition, it provides an 
immortal source for generating cDNA microarrays to be used to profile mRNA levels 
by complex hybridization. 

In another preferred embodiment of the method of the invention, said biological 
-property- is-speeifieity- for-a-eelh-a-tissue^ -or-the~developmental staga of -a cell-or-a — 
tissue, a microorganism, preferably a bacterium, a plant or an organism. 

In this preferred embodiment of the invention, specific comparisons can be made that 
provide the investigator with information, for example, with respect to the 
developmental status of a cell, a tissue, or an organism, or the specificity of a cell or 
a tissue, for example, with respect to its origin. This can be done by comparing two 
tissues from different origins for the presence of certain marker proteins. For 
example, with respect to the developmental status of an organism expression profiles 
of a 6-day old mouse embryo arrayed cDNA expression library and a 9-day old 
mouse embryo arrayed cDNA expression library may be compared to identify and 
characterize differentially expressed genes, thereby elucidating proteins expressed at 
different stages of development. 
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In a further preferred embodiment of the method of the invention, said cell or tissue is 
a normal cell or tissue, a diseased ceil or tissue, or a pretreated cell or tissue. 

The term "pretreated" as used in combination with cell or tissue is intended to mean 
that said cell or tissue has been exposed to a drug, an activator or a ligand etc. Said 
pretreatment will have, as a rule, affected the cellular pathways and optionally 
resulted in at least one phenotypic change as compared to a not pretreated cell. It is 
envisaged that said at least one phenotypic change is detected using the method of 
the invention. Also, it is expected that diseased tissue or cells display phenotypic 
differences as compared to healthy tissues or cells that can be detected with the 
method of the invention. 

In another preferred embodiment of the method of the invention, said clones are 
bacterial transformants, recombinant phage, transformed or transfected mammalian 
cells, insect, fungal, yeast or plant cells. 

Bacterial transformants are preferably transformed E. coli cells; recombinant phage is 
preferably derived from M13 or fd phage; transformed or transfected mammalian 
cells may be Hela or COS cells. As regards insect ceils. Spodoptera frugiperda or 
-Drosophila—melanogasteF -cells - are - preferred.- -Preferred— fungal— cells- comprise— 
Aspergillus cells whereas said yeast cells are preferably derived from Pichia pastoris 
or Saccharomyces cerevisiae. It is to be noted that the terms "transformed" and 
"transfected" are used interchangeably in accordance with this invention. 

In the case that said bacterial transformants are transformed E. coli cells, it is most 
preferred that E. coli SCSI cells as described in the Examples, infra, are used, in 
another most preferred embodiment, the E. coli cells are transformed with a library 
cloned into a vector allowing an inducible expression, preferably also expressing a 
tag as part of said fusion protein, preferably vector pQE-30NST as described in the 
Examples, infra. However, the person skilled in the art in the art is well aware of the 
structural and/or functional features of the E. coli cells and/or vectors as described in 
the Examples such that any E. coli cells and/or vectors displaying essentially the 
same structural and/or functional features are encompassed by the present 
invention. 
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Another preferred embodiment of the invention relates to a method, wherein said 
arrayed form has substantially the same format in steps (a) to (d). 

This embodiment of the invention is particularly useful since it allows for the 
production of replicas from one master plate and the comparison of results on a 1:1 
scale. On the other hand and less preferred, the arrayed form may have a different 
format such as a different scale in at least two of steps (a) to (d) as long as the 
unambiguous relation of clones on the various replicas is still possible. 



In'^a further preferred embodiment of the method of the invention, said arrayed form is 
a grid form. 

The grid should, in accordance with the discussion herein above, preferably allow for 
the high density array of clones of the expression library. It should further preferably 
have the format of grids that have been described in Lehrach, loc. cit. 

In a most preferred embodiment of the method of the invention, said grid has the 
dimensions of a microtiter plate, a silica wafer, a chip, a mass spectrometry target or 
— a-matrix: — 

Using these dimensions, conventional laboratory material can be employed in the 
process of the invention. Additionally, these dimensions allow for the convenient 
analysis of a large number of clones on small scale equipment. 

In another preferred embodiment of the method of the invention, said clones are 
affixed to a solid support. 

The solid support may be flexible or inflexible. This embodiment in particular allows 
for the convenient storage and transport of the arrayed clones of the expression 
library. A particulariy preferred embodiment refers to freeze dried clones that are 
affixed to said solid support. 
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A further preferred embodiment of the method of the invention relates to a method 
wherein said solid support is a filter, a membrane, a magnetic bead, a silica wafer, 
glass, metal, a chip, a mass spectrometry target or a matrix. 

As regards filters or membranes, it is particularly preferred that they are produced 
from PVDF or Nylon. As regards filters or membranes, it is particularly preferred that 
DNA or DNA-containing clones are spotted/gridded/grown on Nylon membrane filters 
(for example, Hybond N+. Amersham) as this has a high DNA binding capacity and 
that proteins or protein-expressing clones are spotted/gridded/grown on 
polyvinylidene difluoride (PVDF) membrane filters (for example, Hybond PVDF, 
Amersham) as this has a high protein binding capacity. 

In a further preferred embodiment of the method of the invention, at least one of said 
ligands is a (poly)peptide, a phage or a fragment thereof, blood, serum, a toxin, an 
inhibitor, a drug or a drug candidate, a non-proteinaceous or partially proteinaceous 
receptor, a catalytic polymer, an enzyme, a nucleic acid, a PNA. a virus or parts 
thereof, a cell or parts thereof, an inorganic compound, a conjugate, a dye, a tissue, 
or a conjugate comprising said ligand. 

— Aecordinglyrthe^ligand can-be-of-a-variety-of^naturesrlmportantiyr the various-types— 
of ligands can be detected directly or indirectly and, thus, allow the identification of 
the desired clones. 

In another preferred embodiment of the method of the invention, said (poly)peptide is 
an antibody or a fragment or derivative thereof, a hormone or a fragment thereof or 
an enzyme or a fragment or derivative thereof. 

The term "fragment or derivative thereof, as used hereinabove, is intended to mean 
that antibodies, hormones or enzymes can be modified such as by deletion of certain 
parts thereof but essentially maintain their capacity to function as a ligand. 

The above preferred (poly)peptides are especially versatile, easy to handle and can 
be provided in large different numbers. 
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In a further preferred embodiment of the method of the invention, said interaction in 
step (c) is a specific interaction. 

An example of this situation is the case where an antibody binds specifically to one 
epitope or (poly)peptide sequence, for example, the anti-histidine antibody binds 
specifically the 6x-histidine tag. 5x-histidine tag, RGS-6x-histidine tag or to an 
epitope which is only found on one protein. 

In an additional preferred embodiment of the method of the invention, said interaction 
in step (c) is an unspecific interaction. 

An example of this situation is the case where an antibody binds non-specifically to 
epitopes which are not coded from identical DNA sequences but share similar three- 
dimensional structure, charge etc.. and can be present on different proteins. As could 
be demonstrated in accordance with the present invention, an application of this 
invention can be to determine the specificity or cross-reactivity of ligands such as 
antibodies. The detection of antibody cross-reactivities on protein microarrays is not 
surprising as antibodies are not usually tested against whole libraries of proteins. The 
method of the present invention for screening antibodies against arrays of potential 
^antigens to deteGt-Gommon-epitopes-may-be-partiGularly-important-for-reagents that 
are to be used for immunohistochemistry or physiological studies on whole cells or 
tissues, where they face batteries of different structures. Alternatively or additionally, 
antibodies with no known antigen specificity (e.g., lymphoma proteins) can be 
screened for binding to a highly diverse repertoire of protein molecules. As all of 
these proteins are expressed from isolated clones of arrayed cDNA libraries, the 
corresponding inserts can easily be sequenced to identify antigen-encoding genes, it 
is envisaged in accordance with the present invention to use the method for 
characterizing the binding and/or non-specificity of antibodies, serum, etc.. for 
homology studies on protein families, and/or for defining binding domains and 
epitopes. Furthermore, the technique is not limited to antigen-antibody screening but 
may be applied to any ligand-receptor system. 
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In another preferred ennbodinrient of the nnethod of the invention, said hybridization in 
step (d) occurs under stringent conditions. It is alternatively preferred that said 
hybridization in step (d) occurs under non-stringent conditions. 

With respect to the significance and applications of the stringent/non-stringent 
hybridizations, essentially the same applies as was set forth in connection with the 
discussion of the specific/unspecific interactions. 

In a particularly preferred embodiment of the method of the invention, said tag is c- 
myc, His-tag, FLAG, alkaline phosphatase. EpiTag™. V5 tag, T7 tag. Xpress"^" tag or 
Strep-tag, a fusion protein, preferably GST. cellulose binding domain, green 
fluorescent protein, maltose binding protein or lacZ. In accordance with the invention, 
two or more tags may be comprised by the fusion protein. 

The expression library employed in the method of the invention may be constmcted 
from a variety of sources. For example, it may be a genomic library or an antibody 
library. Preferably said library of clones comprises a cDNA library. 

The arrayed form is preferably generated using an automated device. 



In a particular preferred embodiment of the method of the invention, said arrayed 
form of said library and/or said replicas is/are generated by a picking robot and/or 
spotting robot and/or gridding robot. 

Another preferred embodiment of the present invention relates to a method further 
comprising sequencing the nucleic acid insert of said desired clone. Sequencing of 
said clone will, in many cases, provide the ultimately desired information obtainable 
with the method of the invention. Protocols for sequencing DNA or RNA are well 
known in the art and described, for example, in Sambrook. loc. cit. 

In a final preferred embodiment of the method of the invention, the method comprises 
identifying the (poly)peptide encoded by the insert of the desired clone. 
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Identification of said (poly)peptide expressed from the desired clone can be effected 
by a variety of methods. Such methods are known inter alia, as standard biochemical 
methods, such as affinity chromatography, SDS-PAGE. ELISA. RIA. etc. Once the 
(poly)peptide has been sufficiently characterized, a corresponding chemical 
component may be devised for pharmaceutical applications, e.g. by peptidomimetics. 

The invention also relates to a method of producing a phamiaceutical composition 
comprising fomiuiating the insert, optionally comprised in a vector or the expression 
product of an insert of a clone conferring a desired biological property, said insert or 
expression product being identified and/or characterized in accordance with the 
method of the invention disclosed herein above. 

Further, the invention relates to a pharmaceutical composition produced by the 
method of the invention. 

The pharmaceutical composition of the present invention may further comprise a 
pharmaceutically acceptable carrier. Examples of suitable pharmaceutical carriers 
are well known in the art and include phosphate buffered saline solutions, water, 
emulsions, such as oil/water emulsions, various types of wetting agents, sterile 
"^olutioris'~etc.~'ColTip'os1tidns~cU^ 
known conventional methods. These pharmaceutical compositions can be 
administered to the subject at a suitable dose. Administration of the suitable 
compositions may be effected by different ways, e.g., by intravenous, intraperitoneal, 
subcutaneous, intramuscular, topical or intradermal administration. The dosage 
regimen will be determined by the attending physician and clinical factors. As is well 
known in the medical arts, dosages for any one patient depends upon many factors, 
including the patient's size, body suri'ace area, age. the particular compound to be 
administered, sex. time and route of administration, general health, and other drugs 
being administered concurrently. Generally, the regimen as a regular administration 
of the pharmaceutical composition should be in the range of 1 ^jg to 10 mg units per 
day. If the regimen is a continuous infusion, it should also be in the range of 1 pg to 
10 mg units per kilogram of body weight per minute, respectively. Progress can be 
monitored by periodic assessment. Dosages will vary but a preferred dosage for 
intravenous administration of DNA is from approximately 10^ to 10^^ copies of the 
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DNA molecule. The compositions of the invention may be administered locally or 
systemically. Administration will generally be parenterally, e.g., intravenously; DNA 
may also be administered directly to the target site. e.g.. by biolistic delivery to an 
internal or external target site or by catheter to a site in an artery. Preparations for 
parenteral administration include sterile aqueous or non-aqueous solutions, 
suspensions, and emulsions. Examples of non-aqueous solvents are propylene 
glycol, polyethylene glycol, vegetable oils such as olive oil, and injectable organic 
esters such as ethyl oleate. Aqueous carriers include water, alcoholic/aqueous 
solutions, emulsions or suspensions, including saline and buffered media. Parenteral 
vehicles include sodium chloride solution, Ringer's dextrose, dextrose and sodium 
chloride, lactated Ringer's, or fixed oils. Intravenous vehicles include fluid and 
nutrient replenishers. electrolyte replenishers (such as those based on Ringer's 
dextrose), and the like. Preservatives and other additives may also be present such 
as. for example, antimicrobials, anti-oxidants, chelating agents, and inert gases and 
the like. 

It is envisaged by the present invention that the various inserts, optionally comprised 
in vectors are administered either alone or in any combination using standard vectors 
and/or gene delivery systems, and optionally together with a pharmaceutically 
^acceptable-carrier or~excipientr SubsequentiO"administration7 said~polynucleotides"or^ 
vectors may be stably integrated into the genome of the subject. On the other hand, 
viral vectors may be used which are specific for certain cells or tissues and persist in 
said cells. Suitable pharmaceutical carriers and excipients are well known in the art. 
The pharmaceutical compositions prepared according to the invention can be used 
for the prevention or treatment or delaying of different kinds of diseases, which are. 
for example, related to B-cell and/or T cell related immunodeficiencies and 
malignancies, any malignant and non-malignant cells/tissues, and/or between 
different strains of organisms, such as pathogenic microorganisms and non- 
pathogenic microorganisms, disease-resistant and/or virus-resistant plants and non- 
resistant, and/or between any two strains, species, etc. of cells, tissues, organisms, 
microorganisms, plants, viruses, phages, bacteria, yeast etc. 

Furthermore, it is possible to use a pharmaceutical composition of the invention 
which comprises the polynucleotide or vector of the invention in gene therapy. 
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Suitable gene delivery systems may include liposomes, receptor-mediated delivery 
systems, naked DNA, and viral vectors such as herpes viruses, retroviruses, 
adenoviruses, and adeno-associated viruses, among others. Delivery of nucleic acids 
to a specific site in the body for gene therapy may also be accomplished using a 
biolistic delivery system, such as that described by Williams (Proc. Natl. Acad, Sci. 
USA 88 (1991). 2726-2729). 

It is to be understood that the introduced inserts and vectors express the gene 
product after introduction into said cell and preferably remain in this status during the 
lifetime of said cell. For example, cell lines which stably express the polynucleotide 
under the control of appropriate regulatory sequences may be engineered according 
to methods well known to those skilled in the art. Rather than using expression 
vectors which contain viral origins of replication, host cells can be transformed with 
the polynucleotide of the invention and a selectable marker, either on the same or 
separate plasmids. Following the introduction of foreign DNA, engineered cells may 
be allowed to grow for 1-2 days in an enriched media, and then are switched to a 
selective media. The selectable marker in the recombinant plasmid confers 
resistance to the selection and allows for the selection of cells having stably 
integrated the plasmid into their chromosomes and grow to form foci which in turn 
~can~Be~doned"and'expanded~in^ celHines.-Such-engineered-celHines-are-also— 
particularly useful in screening methods for the detection of compounds involved in. 
e.g.. B-celiyT-cell interaction. 

A number of selection systems may be used, including but not limited to the herpes 
simplex virus thymidine kinase (Wigier. Cell 11(1977). 223). hypoxanthine-guanine 
phosphoribosyltransferase (Szybalska. Proc. Natl. Acad. Sci. USA 48 (1962). 2026). 
and adenine phosphoribosyltransferase (Lowy. Cell 22 (1980), 817) in tk*. hgprt" or 
aprt* cells, respectively. Also, antimetabolite resistance can be used as the basis of 
selection for dhfr, which confers resistance to methotrexate (Wigier, Proc. Natl. Acad. 
Sci. USA 77 (1980). 3567; O'Hare, Proc. Natl. Acad. Sci. USA 78 (1981). 1527), gpt, 
which confers resistance to mycophenolic acid (Mulligan, Proc. Natl. Acad. Sci. USA 
78 (1981), 2072); neo, which confers resistance to the aminoglycoside G-418 
(Colberre-Garapin. J. MoL Biol. 150 (1981). 1); hygro. which confers resistance to 
hygromycin (Santerre. Gene 30 (1984), 147); or puromycin (pat, puromycin N-acetyl 
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transferase). Additional selectable genes have been described, for example, trpB, 
which allows cells to utilize indole in place of tryptophan; hisD, which allows cells to 
utilize histinol in place of histidine (Hartman, Proc. Natl. Acad. Sci. USA 85 (1988), 
8047); and ODC (ornithine decarboxylase) which confers resistance to the ornithine 
decarboxylase inhibitor. 2-(difluoromethyl)-DL-ornithine, DFMO (McConlogue, 1987, 
In: Current Communications in Molecular Biology, Cold Spring Harbor Laboratory 
ed.). 

The invention also relates to a kit comprising at least two replicas of expression 
libraries as referred to herein above affixed to a solid support. The kit of the invention 
is particularly suitable for carrying out the method of the invention. The various types 
of possible and preferred solid supports have been defined herein above. Preferably, 
the kit of the present invention further comprises at least one ligand as defined 
hereinabove. 

The components of the kit of the invention may be packaged in containers such as 
vials, optionally in buffers and/or solutions. If appropriate, one or more of said 
components may be packaged in one and the same container, 

TlTe'lioWrfientF^iteci~ilT^th^~pTeserif specificatioFT^re^ hefewitb^in'^^^^^ 
reference. 

The figures show: 

Figure 1 

RGS^His detection of protein expression clones with the RGS-His antibody on a high- 
density filter. A filter displaying 27,648 clones, arrayed in duplicate, was screened 
with the RGS-His antibody to detect clones expressing His6-tagged recombinant 
proteins. 

Figure 2 

Identification of GAPDH expression clones, (a) Screening of a DNA filter representing 
27,648 cDNA clones, arrayed in duplicate, with a GAPDH-specific DNA probe, (b) 
Screening of an identical protein filter representing the same clones as in (a) with an 
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anti-GAPDH antibody. Corresponding sections of filters are shown. 



Figure 3 

Venn diagrams showing the categories of clones identified by different probes and 
antibodies. Circles represent sets of clones identified by individual probes. Clones in 
intersections were detected by multiple probes. 

Figure 4 

Sequence alignments of sequences of GAPDH (a) and HSPQOa (b) clones. The open 
reading frames of GAPDH and HSPQOa are shown as open boxes. Each line 
indicates the length of the sequence expected to be present in the respective clone, 
with thicker sections showing the fragment actually sequenced and aligned to the full- 
length mRNA sequence. The letters A-E refer to the categories in Fig. 3. 

Figure 5 

Protein products of clones detected by RGS-His and/or specific antibodies against 
GAPDH (a) or HSP90a (b). Shading and numbers in the boxes across the top 
indicate signal intensities on high-density filters. Whole cellular proteins were stained 
with Coomassie blue. Clone categories are the same as in Fig. 3. 



Figure 6 

Transfer stamp for protein solution transfer from 384-well microtitre plates to PVDF 
membranes. 16 individual, spring-loaded, stainless steel pins are mounted into a 
POM (Polyoxymethylene, Polyformaldehyde, Polyacetale) corpus. The pin-to-pin 
distance is 4,5 mm. The blunt end tip size was measured to 250 pm. 

Figure 7 

Sensitivity of specific protein detection on microarrays. Equimolar concentrations 
(100 pmol/pl - 1 fmol/pl) of purified human GAPDH (duplicates 19-24 and 43-48). 
human bHSPQOalpha (duplicates 7-12 and 31-36) and rat bBlP (duplicates 13-18 
and 37-42) were spotted (5x5 nl) in two identical series of duplicates and detected 
using a monoclonal anti-GAPDH antibody. A: Spot array on PVDF filter membrane 
(1.9 X 1,9 cm holding 128 samples. 4x4 vertical duplicate spotting pattern, black 
duplicate guide spots, counting of duplicates as indicated); B: Relative intensities of 
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means of duplicates in A (guide spots excluded), indicating numbering of duplicates 
(as in A), name and amounts of protein spotted and detection threshold. 

Figure 8 

High-throughput expression of RGS-Hise-tagged fusion proteins from clones of the 
arrayed hEx1 library as detected on a microarray using the monoclonal antibody 
RGS-His (Qiagen). Crude, filtered lysates of 92 clones were spotted from a 96-weil 
microtitre plate, including 4 wells with control proteins (H1. vector pQE-30NST 
without insert; H2, bHSP90alpha, clone N15170, vector pQE-BH6; H3. GAPDH, 
clone D215, vector pQE-30NST; H4, bBlP. vector pQE-BH6). A: Reproducibility of 
detection as diagonal of relative intensities of duplicates; Insert: Spot array-on PVDF 
filter membrane (as in Fig. 7, lower guide spots doubled for orientation); B: 
Diagramme as in Fig. 7, indicating (+) or (-) Reading Frames of inserts if known 
(specificity threshold arbitrarily set to 7,500 relative intensity). 

Figure 9 

Specificity testing of three monoclonal antibodies on identical microarrays of RGS- 
Hise-tagged fusion proteins expressed from clones of the arrayed hExl library as in 
Fig.8. A: monoclonal anti-GAPDH (H3, GAPDH, clone D215, vector pQE-30NST); B: 
~mWoi:lonal'anti-HSP90'alpha-(H2; ^ 
H10, 60S ribosomal protein L18A; H3, GAPDH, clone D215, vector pQE-30NST); C: 
monoclonal anti-alpha tubulin (F9 and A4, RF(+) alpha tubulin clones; C7, RF(-) 
alpha tubulin clone; B1 and B12. unknown genes; H3, GAPDH. clone D215. vector 
pQE-30NST; G1, RF(-) beta tubulin clone; E5, RPL3 ribosomal protein L3; H10. 
RPL18A ribosomal protein L18A; E6 and D8, RPS2 ribosomal protein S2; F7. RPS3A 
ribosomal protein S3A; E3, RPS25 ribosomal protein S25); specificity threshold 
arbitrarily set to 25,000 relative intensity. 
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Table 1 

Evaluation of different screening options for the hEx1 cDNA expression library. Clone 
categories are as in Fig. 3. Numbers in brackets represent second screenings. 

Table 2 

Evaluation of different screening options for the hExl cDNA expression library. Clone 
categories are as in Fig. 2. Numbers in brackets represent second screenings. 

The following examples are intended to illustrate but not to limit the invention. While 
they are typical of those that might be used, other procedures known to those skilled 
in the art may alternatively be used. 

Example 1: Construction of an arrayed human cDNA expression library 

A directionally cloned human fetal brain cDNA library (hExl) was 
constructed in pQE 30NST, a vector for IPTG-inducible expression of 
His6-tagged fusion proteins. pQE30-NST was constructed from pQE-30 
(Qiagen), a pBR322-based expression vector that carries a phage T5 
promoter and^two-lac-operators for IPTGHnducible recombinant protein- 
expression as follows; in the first step. pQE-30N was generated by 
inserting a synthetic oligonucleotide carrying a Bglll and a NotI site into 
the unique PstI site of pQE-30. In subsequent steps, an SP6 promoter 
oligonucleotide carrying an SP6 promoter was inserted between the 
BamHI and the Sail site of pQE-30N. followed by insertion of a second 
oligonucleotide carrying a T7 promoter between the Hindlll and the NotI 
site. The resulting vector, pQE-30NST, can be used for cloning of 
cDNAs with Sail and NotI overhangs. The insert can be transcribed in 
vitro in sense direction using SP6 RNA polymerase and in antisense 
direction using T7 RNA polymerase. 

An average insert size of about 1.4 kb was obtained by PCR analysis of 
14 clones. 

E. coli SCS (Stratagene) carrying pSE111 was used as the host strain 
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to construct this expression library. pSE111 was constructed from 
pSBETc (Schenk et al., BioTechniques 19(2) (1995). 196-198). 

pSBETc is a pACYC177-based expression vector that carries the argU 
gene, a kanamycin resistance gene and a T7 RNA polymerase 
promoter site for recombinant protein expression {Schenk et al., 
BioTechniques 19 (1995). 196 ff.). The helper plasmid pSEIH carries 
the lac repressor gene and the argU (dnaY) gene encoding a rare tRNA 
recognizing AGA and AGG arginine codons (Brinkmann et al.. Gene 85 
(1989), 109-1 14) and was constructed from pSBETc in two steps. 

An Xmnl-EcoRV fragment, nucleotide position 2041-2521, was excised 
from pSBETc to remove the T7 promoter region. 

A 1.2 kb EcoRI fragment containing the laclQ gene was excised from 
plasmid pVHI (Haring et a!.. Proc. Natl. Acad. Sci. USA 82 (1985). 
6090-6094) and inserted into the unique EcoRI site of the plasmid 
resulting from step (i). Plasmids of 5.1 kb with laclQ inserts in both 
possible orientations were obtained; In pSEIII transcription of the 
laclQ' gene wasl;lockwise'inlh"el)u '~ 
BioTechniques 19 (1995). 196 ff.). This plasmid was present in the E. 
coli strain SCSI (Stratagene) used as the host strain for the cDNA 
expression library. 

Using a picking/gridding robot. 80.640 clones were picked into 384-well 
microtiter plates and gridded at high density onto nylon and 
polyvinylidene difluoride (PVDF) filters. Nylon filters were processed for 
DNA hybridizations (DNA filters), whereas PVDF filters were transferred 
onto agar plates containing IPTG for induction of protein expression and 
processed for protein detection (protein filters). 
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Example 2: Rearraying clones that express a recombinant insert 



The human fetal brain library constructed as described in Example 1 
was analyzed and 20% of the clones were identified giving a positive 
signal with the anti-His antibody. Their coordinates in the library in the 
microtiter plates were obtained, and these clones were rearrayed. now 
considered a new library, and grown as before. The clones were picked, 
gridded, and grown, and protein and DNA filters were made as 
described hereinabove. The protein filter was incubated with the anti- 
His antibody and 90-100% of the clones gave a positive signal. 

Example 3: Protein expression screening on high-density filters 

High-density protein filters of the hEx1 library were screened with the 
monoclonal RGS-His antibody recognizing the N-terminal sequence 
RGSH6 of recombinant fusion proteins overexpressed from the pQE- 
30NST vector. (Fig. 1). Approximately 20% of the clones were positive 
(signals of intensities 1, 2 or 3), classified one to three. These clones 
were considered putative protein expression clones (Fig. 1). The hEx1 
cDNA library was prepared from human fetal brain tissues by oligo (dT) 

priming- (Gubler- et- al.T- Gene- 2-5- (4 983)r-263)-using--a^ 

Plasmid System kit (Life Technologies). cDNA was size-fractionated by 
gel filtration and individual fractions were ligated between the Sal! and 
NotI sites of the expression vector pQE-30NST. E. coli SCSI 
(Stratagene) carrying the helper plasmid pSE111 was used as the host 
strain. After transformation by eiectroporation, the library was plated 
onto square agar plates (Nunc Bio Assay Dish) and grown at 37°C 
overnight. Using an automated robotic system (Lehrach et al, 
Interdisciplinary Science Reviews 22 (1997), 37-44), colonies were 
picked into 384-well microtiter plates (Genetix) filled with 2^YT medium 
containing 100 pg/ml ampicillin. 15 pg/ml kanamycin. 2% glucose and 
freezing mix (0.4 mM MgS04. 1.5 mM Na3-citrate. 6.8 mM (NH4)2S04. 
3.6% glycerol. 13 mM KH2P04. 27 mM K2HP04, [pH 7.0]). Bacteria 
were grown in the microtiter wells at 37^C overnight and replicated into 
new microtiter plates using 384-pin replicating tools (Genetix). All 
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copies were stored frozen at 80°C. 
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Example 4: Identification of genes and proteins on corresponding filter sets 

GAPDH and HSP90a were chosen as example proteins, with open 
reading frames of 1.008 bp and 35.922 Dalton for GAPDH (Swiss-Prot 
P04406) and 2.199 bp and 84.542 Dalton for HSP90a (Swiss-Prot 
P07900). 

A set of three high-density DNA filters (80.640 clones) of the hEx1 
library was screened with gene-specific cDNA probes. High-density 
filters were prepared by robot spotting, as described (Maier at al., Drug 
Discovery Today 2 (1994), 315-324; Lehrach et al.. Interdisciplinary 
Science Reviews 22 (1997), 37-44), Bacterial colonies were gridded 
onto Nylon membrane filters (Hybond N+, Amersham) for DNA analysis 
and on polyvinylidene difluoride (PVDF) membrane filters (Hybond- 
PVDF, Amersham) for protein analysis (filter format 222 mm x 222 
mm). Clones were spotted at a density of 27,648 clones per filter in a 
duplicate pattern, surrounding ink guide dots. High-density filters were 
placed onto square 2xYT agar plates (Nunc Bio Assay Dish) containing 
100 |jg/ml ampicillin. 15 yg/mi kanamycin and 2% glucose. 



Filters to be ijsed for DNA analysis were grown overnight at 37°C and 
subsequently processed as previously described (Hoheisel et al., J. 
Mo!. Biol. 220 (1991). 903-914). Filters for protein analysis were grown 
overnight at 30°C and subsequently then transferred onto agar plates 
supplemented with 1 mM IPTG to induce protein expression that was 
induced for 3 hours at 37^C. Expressed proteins were fixed on the filters 
by placing the filters onto blotting paper soaked in 0.5 M NaOH. 1.5 M 
NaCl for 10 minutes, twice for 5 minutes onto 1 M Tris-HCI. pH 7.5. 1.5 
M NaCI for 5 minutes and finally onto 2xSSC for 15 minutes. Filters 
were air-dried and stored at room temperature. 

DNA hybridizations using digoxigenin-labeled PGR probes and 
Attophos alkaline phosphatase substrate (JBL Scientific, San Luis 
Obispo) were performed as described (Maier et aL, J. Biotechnol. 35 
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(1994), 191-203). Digoxigenin-labeled hybridization probes were 
prepared by PCR-amplification of a clone containing the complete open 
reading frame of human GAPDH and of the IMAGE clone number 
343722 containing a C-terminai part of HSP90a (GenBank W69361). 

With a human GAPDH probe (Fig. 2a). 206 (0.26%) clones were 
positive (Table 1) (Fig. 2a). A second hybridization confirmed 202 and 
detected 35 additional clones (raising the total to 237, Table 1). 56 
(0.07%) clones were identified with a human HSP90a probe. On 
corresponding protein filters, 56 (27%) or 14 (25%) of GAPDH or 
HSP90a positive clones, respectively, were recognized by the RGS-His 
antibody. 

Antibody screening on high-density filters was performed as follows: a 
rabbit anti-GAPDH serum was affinity purified as described (Gu et al.. 
BioTechniques 17 (1994). 257-262). Anti-HSP90 (Transduction 
Laboratories, Lexington) is directed against amino acids 586 to 732 of 
HSP90a. Dry protein filters were soaked in ethanol. and bacterial debris 
was wiped off with paper towels in TBST-T (20 mM Tris-HCI, pH 7.5, 

0:5-M ■Nae!rO:O5%-Tween"2Or0:5%-Triton-X=1OO).-The- filters-were-- 

blocked for 1 hour in blocking buffer (3% non-fat, dry milk powder in 
TBS. 150 mM NaCI. 10 mM Tris-HCI, pH 7.5) and incubated overnight 
with 50 ng/ml anti-HSP90 antibody or the anti-GAPDH antibody, diluted 
1:5000. After two 10 minute washes in TBST-T and one in TBS, filters 
were incubated with alkaline phosphatase (AP)-conjugated secondary 
antibody for 1 hour. Following three 10 minute washes in TBST-T. one 
in TBS and one in AP buffer (1 mM MgC12. 0,1 M Tris-HCI. pH 9.5), 
filters were incubated in 0.5 mM Attophos (JBL Scientific. San Luis 
Obispo) in AP buffer for 10 minutes. Filters were illuminated with long- 
wave UV light, and a high-resolution CCD detection system was used 
for image generation (Maier et al.. Drug Discovery Today 2 (1997), 315- 
324). Positive clones were scored using custom-written image analysis 
software. With a polyclonal anti-GAPDH antibody (Fig. 2b). 39 clones 
were positive (Table 2). These were all detected by the RGS-His 
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antibody but only 32 clones scored positive with the GAPDH-specific 
DNA probe. However, 5 of the 7 unaccounted clones were detected in 
the second DNA hybridization. Screening with a monoclonal anti- 
HSP90 antibody yielded 32 positive clones. 28 of which were detected 
by the HSP90a DNA probe, and 10 were positive with both the HSP90a 
DNA probe and the RGS-His antibody. In a second anti-HSP90 
screening, 30 clones were confirmed, and 12 new clones were 
detected, which were all positive with the HSP90a DNA probe. 



Example 5: Sequence and Western blot analysis of detected clones 



Fig. 3 summarizes the filter data obtained for GAPDH and HSP90a. 
Clones from categories A-E were analyzed by sequencing the 5'-ends 
of their cDNA inserts (Fig. 4) and by western blotting (Fig. 5). The 
following experimental protocols were carried out. 



Ten GAPDH clones identified with the DNA probe, the anti-GAPDH and 
the RGS'His antibody were sequenced and found to contain GAPDH 
sequences in the correct reading frame. Nine clones expressed 
-recombinant Hise^tagged-proteins-spanning the-fuIhGAPDK sequence — 
plus 5'-UTR and vector-amino acids encoded amino acids by the 5- 
UTR of the mRNA and the vector. 

All ten clones positive with the HSP90a DNA probe, the RGS-His and 
the anti-HSP90 antibody had HSP90a. sequences in the correct reading 
frame. However, none of them accommodated the full coding region, 
and five clones were shown to express His6-tagged fusion proteins 
translated from differently sized C-termina! parts of the HSP90a 
sequence. 



Sequences of seven GAPDH clones negative with the specific-GAPDH 
antibody on filters were shown to overlap the GAPDH GenBank 
sequence. Two of these clones had inserts in the correct reading frame 
and expressed GAPDH fragments (24 kD) that were stained by the anti- 



(A) 



All-round positives 



(B) 



Specific antibody negatives 
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GAPDH antibody on western blots (Fig. 5a. B, lanes 11, 12). GAPDH 
inserts were in incorrect reading frames in the other five clones, 
suggesting expression of which supposedly expressed peptides in the 
range of 6.5- to 16.7 kD polypeptides (Fig. 5a, B, lanes 13-17). Signal 
intensities of these clones were generally low when probed with the 
RGS-His antibody on high-density filters. 

Three of four HSPQOa clones had inserts in an incorrect reading frame, 
and expressed short peptides not reactive with the anti-HSP90 antibody 
(two clones shown in Fig. 5b, lanes 6, 8). The remaining clone carried 
an insert in the correct reading frame gave a band of the calculated size 
(56.0 kD) on western blots (Fig. 5b, lane 7) and was detected by the 
anti-HSP90 antibody in a second high-density filter screening. 

(C) DNA probe-only positives 

Eleven out of twelve randomly selected GAPDH clones were shown to 
contained a GAPDH insert in an incorrect reading frame, supposedly 
expressing peptides in the range of 3.4 to 9.1 kD. Clone 
MPMGp800A1755 had an insert in the correct reading frame but carried 
— — - — — a-point-mutation-at positioners -in-the-5*-UTR7 leading io a- stop-codon — 
and a calculated 4.7 kD peptide. 

DNA sequence analysis indicated that eleven out of twelve HSP90a 
clones contained inserts in an incorrect reading frame and possibly 
expressed peptides of 2.8- to 5.4 kD calculated molecular mass. Only 
clone MPMGp800l131 15 had an insert in the correct reading frame, 
expressed a protein of 78.7 kD size and was positive in a second anti- 
HSP90 antibody screening. 

No false positives were found for the GAPDH or the HSP90a DNA 
probe. 

(D) DNA probe negatives 

Four GAPDH clones were shown to have correct inserts, representing 
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false negatives of the DNA probe but were detected in a second DNA 
hybridization experiment. Two clones contained sequences of human 
polyubiquitin (GenBank D53791) and human HZF10 (PIR S47072). 

All four HSPQOa clones expressed polypeptides detected on western 
blots (Fig. 5b, D). Clone MPMGp800G06207 (lane 12) contained an 
HSP90a insert carrying a 46 bp deletion and was obviously a false 
negative of the HSP90a DNA probe. The remaining three clones 
accommodated inserts with sequence homology to murine uterine- 
specific proline-rich acidic protein (GenBank U28486; lanes 9, 10) or 
identity to an EST sequence of unknown function (lane 1 1 ). 

(E) DNA probe and specific antibody positives (RGS-His negatives) 

Ten clones recognized by the HSP90a DNA probe and the anti-HSP90 
antibody but not by the RGS-His antibody, were sequenced and found 
to contained HSP90a sequences inserted in an incorrect reading frame. 
His6-tagged polypeptides expressed from these clones would have 
calculated masses of 3.2- to 6.1 kD and were not found in western blots 
(Fig 5b, E). In contrast, matching patterns of bands were observed with 
_____ ^^.^ anti-HSP90-antibody: 

Bacteria containing cDNA clones were grown by shaking in 2 ml 2xYT 
medium containing 100 pg/ml ampicillin, 15pg/ml kanamycin and 2% 
glucose. At an 0.0.600=0.4, IPTG was added to 1 mM final 
concentration, and the incubation was continued for 3 h at 37°C, Whole- 
cell protein extracts were subjected to 15% SDS-PAGE and stained 
with Coomassie blue, according to Laemmli (Laemmli, Nature 227 
(1970). 680-685) 

After SDS-PAGE, proteins were transferred onto PVDF membranes 
(Immobilon P. Millipore) in 20 mM Tris, 150 mM glycine. 0.1% SDS. 
10% methanol, using a semi-dry electrotransfer apparatus (Hoefer 
Pharmacia Biotech, San Francisco), according to the manufacturer's 
recommendations. 
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cDNA inserts were amplified by PGR using primers pQE65 (TGA GCG 
GAT AAC AAT TTC ACA GAG) and pQE276 (GGC AAG GGA GCG 
TTC TGA AC) at an annealing temperature of 65^C. PGR products were 
sequenced using dye-terminator cycle sequencing with the pQE65 
primer and ABI sequencers (Perkin Elmer) by the service department of 
our institute. 



Example 6: Vector constructs 

pQE-30NST (GenBank accession number AF074376) has been 
described (Bussow et al., Nucleic Acids Res. 26 (1998), 5007-5008). 
pQE-BH6 was constructed using the polymerase chain reaction (PGR) 
for insertion of an oligonucleotide encoding the protein sequence 
LNDIFEAQKIEW between MRGS and HIsq of pQE-30 (Qiagen). thereby 
separating the two parts of the RGS- Hise epitope. 



Example?: Antibodies 

Monoclonal antibodies of the following manufacturers were used at 
dilutions as indicated: mouse anti-RGS-His (QIAGEN, 1:2.000), mouse 

anti=rabbit-GAPDH~(Research-DiagnosticsHnc-"Clone-6e5;-1-:5;000); — 

mouse anti-HSP90 (Transduction Laboratories, clone 68, 1: 2,000). rat 
anti-alpha tubulin (Serotec Ltd.. clone YL1/2. 1:2.000). 
Secondary antibodies were F(ab')2 rabbit anti-mouse IgG HRP (Sigma) 
and F(ab')2 rabbit anti-rat IgG HRP (Serotec Ltd.). diluted 1:5.000. for 
the detection of mouse and rat monoclonals, respectively. 



Example 8: Large-scale protein expression and purification 

Proteins were expressed in E coli (strain SCSI) liquid cultures. 900 ml 
SB medium (12 g/l Bacto-tryptone, 24 g/1 yeast extract. 17 mM KH2PO4. 
72 mM K2HPO4. 0.4% (v/v) glycerol) containing 100 pg/ml ampicillin 
and 15 pg/ml kanamycin were inoculated with 10 ml of an overnight 
culture and shaken at 37°C until an ODeoo of 0.8 was reached. 
Isopropyl-b-D-thiogalactopyranosid (IPTG) was added to a final 
concentration of 1 mM. The culture was shaken for 3.5 h at 37°G and 
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cooled to 4°C on ice. Cells were harvested by centrifugation at 2,100 g 
for 10 min, resuspended in 100 ml Phosphate Buffer (50 mM NaHaPO^, 
0.3 M NaCI, pH 8.0) and centrifuged again. Cells were lysed in 3 ml per 
gram wet weight of Lysis Buffer (50 mM Tris. 300 mM NaCI, 0.1 mM 
EDTA. pH 8.0) containing 0.25 mg/ml lysozyme on ice for 30 min. DNA 
was sheared with an ultrasonic homogeniser (Sonifier 250, Branson 
Ultrasonics. Danbury. USA) for 3 x 1 min at 50% power on ice. The 
lysate was cleared by centrifugation at 10,000 g for 30 min. Ni-NTA 
agarose (Qiagen) was added and mixed by shaking at 4°C for 1 h. The 
mixture was poured into a column which was subsequently washed with 
ten bed volumes of Lysis Buffer containing 20 mM imidazole. Protein 
was eluted in Lysis Buffer containing 250 mM imidazole and was 
dialysed against TBS (10 mM Tris-HCI, 150 mM NaCI, pH 7.4) at 4*'C 
overnight. 

Example 9: High-throughput small-scale protein expression 

Proteins were expressed from selected clones of the arrayed human 
fetal brain cDNA expression library hExl (Bussow et al., Nucleic Acids 
Res. 26 (1998), 5007-5008). This library was directionally cloned in 
pQE-TO"NST^ fof TPTG-inducible" ex^r"essioh~~ of ~ Hise-faggecl^fusion' 
proteins. 96-well microtitre plates with 2 ml cavities (StoreBlock, 
Zinsser) were filled with 100 pi SB medium, supplemented with 100 
pg/ml ampicillin and 15 pl/ml kanamycin. Cultures were inoculated with 
E. coli SCSI cells from 384-well library plates (Genetix, Christchurch, 
U.K.) that had been stored at -80°C. For inoculation, replicating devices 
carrying 96 steel pins (length 6 cm) were used. After overnight growth 
at 37°C with vigorous shaking, 900 pi of prewarmed medium were 
added to the cultures, and incubation was continued for 1 h. For 
induction of protein expression. IPTG was added to a final 
concentration of 1 mM. All following steps, including centrifugations, 
were also done in 96-well format Cells were harvested by 
centrifugation at 1,900 g (3,400 rpm) for 10 min, washed by 
resuspension in Phosphate Buffer, centrifuged for 5 min and lysed by 
resuspension in 150 pi Buffer A (6 M Guanidinium-HCI, 0.1 M 
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NaH2P04, 0.01 M Tris-HCI. pH 8.0). Bacterial debris was pelleted by 
centrifugation at 4,000 rpm for 15 min. Supernatants were filtered 
through a 96-well filter plate containing a non-protein binding 0.65 pm 
pore size PVDF membrane (Durapore MADV N 65, Millipore) on a 
vacuum filtration manifold (Multiscreen, Millipore). 

Example 10: Automated filter spotting 

Pre-cut (25 x 75 mm) polyvinylidene difluoride (PVDF) filter strips 
(Immobilon P, Millipore) were soaked with 96 % ethanol and rinsed in 
distilled water for 1 min. Five wet filter strips were fixed with tape onto a 
230 x 230 mm plastic tray. The spotting was done by a motor-carried 
transfer stamp (Fig. 6) which can be positioned at a resolution of 5 pm 
in x-y-2 directions (Linear Drives, Basildon, UK). This allows densities of 
approximately 300 samples/cm^, spotted in a duplicate pattern. The 
transfer stamp accommodates 4x4 = 16 individually mounted, spring- 
loaded pins at 4.5 mm spacing. Since the spacing is compatible to the 
spacing of 384-well plates, this tool enables high-density spotting out of 
384-well microtitre plates. The size of the blunt-end tip of the stainless 
steel pins is 250 pm. Prior to each transfer, the spotting gadget was 
washedHn-a-30 yo-ethanol bath and-subsequently^dried-with-a-fan^to- 
prevent cross contamination. For the experiments shown here, 4x4 
patterns were spotted with each pin. Each pattern contains four ink 
guide spots surrounded by six samples spotted in duplicate (12 sample 
spots in total, Fig. 7A). Each spot was loaded five times with the same 
protein sample (5 nl each). Having adjusted the spotting height in 
advance, the spotting of 96 samples took approximately 20 min for the 
generation of five identical protein microarrays. 

Example 11:Antibody detection and image analysis 

After spotting, filters were soaked in ethanol for 1 min, rinsed in distilled 
water, washed in TBST (TBS, 0.1% Tween 20) for 1 min, blocked in 2% 
bovine serum albumin (BSA)/TBST for 60 min and incubated with 
monoclonal antibodies in 2% BSA/TBST for 1 h at room temperature, 
followed by two 10 min TBST washes and 1 h incubation with 
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secondary antibodies in 2% BSA/TBST. Subsequently, filters were 
washed in 20 nnl TEST overnight, incubated in 2 ml CN/DAB solution 
(Pierce) for 1-10 min, and positive reactions were detected as black 
spots. Images were acquired with a cooled CCD Camera (Fuji LHS, 
Raytest, Germany). Pictures were taken through a Fujinon objective (f: 
0.8. 50 mm) with an Integration time of 20 ms. Image analysis was done 
with the AIDA package (Raytest, Germany) for spot recognition and 
quantification. The resulting spot values were transferred to an Excel 
spreadsheet (Microsoft. USA) to display the diagrammes of Figs. 7. 8 
and 9. 



Example 12; Fabrication of protein microarrays 

Proteins were expressed in liquid bacterial cultures, and solutions were 
spotted onto PVDF filters, either as crude lysates or after purification by 
Ni-NTA immobilised metal affinity chromatography (IMAC) (Hochuli et 
al. J. Chromatography. 411 (1987). 177-184). PVDF filter membranes 
were used for their superior protein binding capacity and mechanical 
strength (compared to nitrocellulose) and satisfactory former 
performance (Bussow et al.. Nucleic Acids Res. 26 (1998). 5007-5008). 
The~new transfer~stamp^(Fig: "6)~consists "of"pins"with 250~pm-tip^si2e7 
which is nearly half the size of the 450 pm pins that have previously 
been used for the generation of in situ protein expression filters 
(Bussow et al.. Nucleic Acids Res. 26 (1998). 5007-5008). Although 
Figs. 7, 8 and 9. as our first test results, show about the same spotting 
density as our in situ filters, the smaller pin tip diameter enables higher 
spotting densities. While an in situ filter of 222 x 222 mm 
accommodates 27,648 clones (5x5 duplicate spotting pattern with one 
guide spot), more than 100,000 samples could be placed onto the same 
area using the new transfer stamp. This allows a substantial reduction 
in total array size to a convenient microscopic slide format (25 x 75 mm 
holding 4,800 samples, corresponding to 2.400 duplicates). The 
miniaturised set-up allows a very economic use and high concentrations 
of reagents in incubating solutions as a much smaller buffer volume is 
needed to cover the filters. In contrast to in situ filters, the signals 
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obtained on microarrays are sharp and well localised. As the next step 
towards the fabrication of protein chips, we envisage a further increase 
in density by using high-speed picolitre spotting (inkjetting) onto 
modified glass . surfaces. Alternative approaches to protein microarrays 
have been reported using either photolithography of silane monolayers 
(Mooney et aL. Proc. Natl. Acad. Sci. USA. 93 (1996). 12287-12291) or 
inkjetting onto polystyrene film (Ekins, Clin. Chem. 44 (1998), 2015- 
2030; Silzel et aL, Clin. Chem. 44 (1998), 2036-2043). In contrast to our 
library spotting technology, those advances have been focused on the 
fabrication of miniaturised immunoassay formats by patterning of single 
proteins (e.g.. BSA. avidin oranti-IgG monoclonal antibodies). 

Example 13: Sensitivity of specific protein detection 

The sensitivity of specific protein detection on microarrays was 
assessed by spotting different concentrations of three purified proteins, 
human glyceraldehyde-3-phosphate dehydrogenase (GAPDH, Swiss- 
Prot P04406). a C-terminal fragment (40.3 kd) of human heat shock 
protein 90 alpha (HSP90aIpha. Swiss-Prot P07900) and rat 
immunoglobulin heavy chain binding protein (BIP, Swiss-Prot P06761). 

Microarrays*~were^ subsequently^incubated^with ~a^ monoclonal- anti-- 

GAPDH antibody, rabbit anti-mouse IgG HRP and HRP substrate 
CN/DAB (Fig. 7A). The sensitivity of detection, as the lowest 
concentration that delivered clearly visible, specific spots above 
background (detection threshold), was calculated to be 10 fmol/pl. 
corresponding to 250 attomol or 10 pg of GAPDH in 5 x 5 nl spotted 
(Fig. 78). 

Example 14:High-throughput screening for protein expression 

Crude lysates of 92 clones of the arrayed human fetal brain cDNA 
library hExl (Bussow et a!.. Nucleic Acids Res. 26 (1998), 5007-5008). 
previously identified as protein expressors by the monoclonal antibody 
RGS-His (Qiagen) on in situ filters, were spotted in duplicate, alongside 
with 4 control samples and ink guide spots. Microarrays were screened 
for expression of RGS-HtSe-tagged fusion proteins using the same 
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antibody (Fig. 8A, insert). When relative intensities of duplicates (see 
Fig. 7A) are plotted against each other, the resulting diagonal indicates 
a good reproducibility of the detection nnethod (Fig. 8A). Therefore, 
means of duplicates were plotted for all 96 samples, and an arbitrary 
specificity threshold for identification of positives was set to 7»500 
relative intensity (Fig. 8B). Under these conditions, a negative control 
(H1, vector pQE-30NST without insert) was clearly negative (1,500 
relative intensity), as was an HSP90alpha clone, featuring a divided 
RGS-Hise epitope (H2, vector pQE-BH6; 0 relative intensity). The lysate 
of an RGS-Hise-tagged GAPDH clone (H3. vector pQE-30NST) was 
used as a positive control and delivered a signal of 21 .000 relative 
intensity. The clearly positive result (15.000 relative intensity) obtained 
with a rat BIP clone (H4, vector pQE-BH6) is surprising because this 
clone also features a divided RGS-Hise epitope. The reactivity might be 
explained by partial re-constitution of the RGS-Hise epitope due to 
conformational characteristics of BIP. 

The cDNA inserts of 54 of the 92 putative hExl expression clones show 
homology to Genbank entries of human genes (Bussow, Thesis, 
Department of Chemistry, Free University Beriin (1998)). These inserts 
were 'ch ecked" for' their' reading "f ra mes ~( RF)~in~ relation "to~the~vector-~ 
encoded RGS-Hise tag sequence. 34 inserts (63%) were found to be 
cloned in the correct reading frame (RF+), while 20 (37%) were in an 
incorrect reading frame (RF-). hence those clones could not be 
expected to express the predicted protein. However, all 92 clones were 
originally selected as protein expressors on in situ filters due to cleariy 
positive signals with the monoclonal antibody RGS-His [intensity levels 
2 and 3, (Bussow, Thesis, Department of Chemistry, Free University 
Beriin (1998))]. On microarrays. the number of incorrect reading frame 
clones identified as protein expressors was decreased by 70%, as only 
6 RF(-) clones were still confirmed as positives (Fig. 8B). This indicates 
that the new microarray technology is a major advancement over in situ 
filters for its superior ability to exclude incorrect reading frame clones. 
On the other hand, only one RF(+) clone was clearly below the 
specificity threshold and would have been missed in this screen, 
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probably due to an insufficient amount of protein expressed in the 
microtitre well. This stresses again the nature of our approach that is 
exclusively based on "positives" to be confirmed by sequencing and/or 
protein characterisation (Bussow et al.. Nucleic Acids Res. 26 (1998). 
5007-5008). 

In summary, the high-throughput protein expression screening on 
microarrays resulted in a false negative rate of under 2% (1 undetected 
RF(+) clone per 54 clones total). The rate of false positive clones, 
expressing proteins in incorrect reading frames, was down to 11%, 
compared to 37% on in situ filters (Bussow. Thesis, Department of 
Chemistry, Free University Berlin (1998). That makes protein 
microarrays an economical tool for very sensitive protein expression 
screening. 

Example 15: Antibody specificity screening 

Protein microarrays featuring the same test set of 92 hExl expression 
clones and 4 controls (see above) were screened for the human 
proteins GAPDH. HSP90alpha and alpha tubulin using monoclonal 
antibodies. While the anti-GAPDH antibody detected its target antigen 

" exclusiveTy'(H37 Figr9A)'. anti-HSP90alpRFpfefel^tiany^fe^^ 

target antigen (H2. Fig. 9B) but showed some cross-reactivity with at 
least two other clones (H10, 60S ribosomal protein L18A and H3, 
GAPDH). Antibody cross-reactivity was even more pronounced in the 
anti-alpha tubulin screen (Fig. 9C). While the two RF(+) alpha tubulin 
clones in the test set (F9 and A4) were specifically recognised and the 
only RF(-) clone (C7) was left undetected, nine other clones showed 
anti-alpha tubulin reactivity above the arbitrary specificity threshold. 
Two of these clones (B1 and B12) represent unknown genes, and G1 is 
an RF(-) beta tubulin clone. H3 is the GAPDH positive control clone of 
Fig. 8 (see above), which to some extent seems to cross-react 
unspecifically (Figs. 98 and 9C), possibly due to an exceptionally high 
level of protein expression. Surprisingly, all other (five) clones above 
threshold express ribosomal proteins in a correct reading frame (E5. 
RPL3; H10. RPL18A; E6 and D8. RPS2; F7. RPS3A). Only one 
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additional ribosomal protein in the test set (E3, RPS25) did not show an 
anti-alpha tubulin reactivity. The epitope recognised by the anti-alpha 
tubulin antibody (YL1/2, (Kilmartin et aL, J. Cell Biol. 93 (1982), 576- 
582)) was identified as the linear sequence spanning the carboxy- 
terminal residues of tyrosinated alpha tubulin (Wehland et al., EMBO J. 
3 (1984), 1295-1300). According to those authors, the nninimal 
sequence requirements, as defined by dipeptide studies, are a 
negatively charged side chain in the penultimate position followed by an 
aromatic residue which must carry the free carboxylate group. As none 
of the cross-reacting ribosomal proteins on our microarrays fulfill these 
requirements, other (e.g., stojctural) epitopes might mimic the antigenic 
specificity. 
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3 
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CLAIMS 

A method for the selection of clones of an expression library that express 
inserts comprising the steps of 

(a) analyzing for the expression of at least one detectable (poly)peptide 
expressed as a fusion protein with an expression product of a 
recombinant insert of a clone of said expression library, the clones of 
said expression library being arranged in arrayed form; and 

(b) rearraying clones in which expression of said at least one (poly)peptide 
is detected. 

The method of claim 1 further comprising the following steps 

(c) contacting a ligand specifically interacting with a (poiy)peptide 
expressed by the insert of a clone conferring a desired biological 
property with said library or a first replica of said rearrayed library of 
clones and analyzing said library of clones for the occurrence of an 
interaction; and/or 

(d) carrying out a hybridization or an oligonucleotide fingerprint with a 
nucleic acid probe specific for the insert of a clone conferring said 
desired biological property with said library or said first replica or a 
second replica of said library of rearrayed clones and analyzing said 
library of clones for the occurrence of a hybridization; and 

(e) identifying and/or characterizing clones wherein an expression of the at 
least one (poly)peptide in step (a) and/or an interaction in step (c) 
and/or a specific hybridization or an oligonucleotide fingerprint in step 
(d) can be detected. 

The method of claim 1 or 2, wherein said (poly)peptide expressed as a part of 
a fusion protein with said expression product of said recombinant insert is an 
antibody or a fragment or derivative thereof, a tag, an enzyme, or a phage 
protein or fragment thereof, or a fusion protein. 



PCT/EP99/02964 



wo 99/57312 PCT/EP99/02964 

41 

4. The method of any one of claims 1 to 3, wherein said analysis for the 
expression of a (poly)peptide in step (a) is effected by contacting a ligand 
different from the ligand of step (c) that specifically interacts with said 
(poly)peptide and analyzing said library of clones for a specific interaction to 
occur. 

5. The method of any one of claims 2 to 4. wherein said desired biological 
property is specificity for a cell, a tissue, or the developmental stage of a cell 
or a tissue, a microorganism, preferably a bacterium, a plant or an organism. 

6. The method of claim 5, wherein said cell or tissue is a normal cell or tissue, a 
diseased cell or tissue, or a pretreated cell or tissue. 

7. The method of any one of claims 1 to 6, wherein said clones are bacterial 
transformants, recombinant phage, transformed mammalian, insect, fungal, 
yeast or plant cells. 

8. The method of any one of claims 1 to 7, wherein said arrayed form has 
substantially the same format in steps (a) to (d). 

9. The method of any one of claims 1 to 8, wherein said arrayed form is a grid 
form. 

10. The method of claim 9, wherein said grid has the dimensions of a microtiter 
plate, a silica wafer, a chip, a mass spectrometry target or a matrix. 

11. The method of any one of claims 1 to 10, wherein said clones are affixed to a 
solid support. 

12. The method of claim 11, wherein said solid support is a filter, a membrane, a 
magnetic bead, a silica wafer, glass, metal, a chip, a mass spectrometry target 
or a matrix. 
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13. The method of any one of claims 2 to 12. wherein at least one of said ligands 
is a (poly)peptide, a phage or a fragment thereof, blood, serum, a toxin, an 
inhibitor, a drug or a drug candidate, a non-proteinaceous or partially 
proteinaceous receptor, a catalytic polymer, an enzyme, a nucleic acid, a 
PNA, a virus or a part thereof, a cell or a part thereof, an inorganic compound, 
a conjugate, a dye. a tissue or a conjugate of said ligand. 

14. The method of claim 13, wherein said (poly)peptide is an antibody or a 
fragment or derivative thereof, a hormone or a fragment thereof or an enzyme 
or a fragment or derivative thereof. 

15. The method of any one of claims 2 to 14, wherein said interaction in step (c) is 
a specific interaction. 

16. The method of any one of claims 2 to 14, wherein said interaction in step (c) is 
an unspecific interaction. 

17. The method of any one of claims 2 to 16, wherein said hybridization in step (d) 
occurs under stringent conditions. 



18. The method of any one of claims 2 to 16, wherein said hybridization in step (d) 
occurs under non-stringent conditions. 

19. The method of any one of claims 3 to 18, wherein said tag is c-myc, His-tag, 
FLAG, alkaline phosphatase, EpiTag"^^, V5 tag, T7 tag. Xpress^^" tag or Strep- 
tag, a fusion protein, preferably GST, cellulose binding domain, green 
fluorescent protein, maltose binding protein or iacZ. 

20. The method of any one of claims 1 to 19, wherein said library of clones 
comprises a cDNA library. 

21. The method of any one of claims 1 to 20. wherein said arrayed form of said 
library and/or said replicas is/are generated by an automated device. 
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22. The method of claim 21, wherein said automated device is a picking robot 
and/or spotting robot and/or gridding robot. 

23. The method of any one of claims 1 to 22, wherein said analysis for the 
expression of a (poly)peptide in step (a) is effected by visual means, 
preferably mass spectrometry. 

24. The method of any one of claims 1 to 23 further comprising sequencing the 
nucleic acid insert of said desired clone. 

25. The method of any one of claims 1 to 23 further comprising identifying and/or 
characterizing the (poly)peptide encoded by the insert of the desired clone. 

26. A method for producing a pharmaceutical composition comprising formulating 
the insert, optionally comprised in a vector or the expression product of an 
insert of a desired clone conferring a desired biological property, said insert or 
expression product being identified and/or characterized in accordance with 
the method of any one of claims 1 to 25. 

277 A~ph^rm^"euticarcomposition^^ 

28. Kit comprising at least two replicas of expression libraries as defined in any 
one of claims 1 to 23 affixed to a solid support. 
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